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Confidence  Envelopes  for  Monotonie  Functions: 

Principles,  Derivations,  and  Examples. 

I.  Curve  Envelopes 

In  seminal  work  published  in  1929,  Working  and  Hotelling  developed  the  formulae  for  the 
extrapolation  of  the  concept  of  the  confidence  interval  to  regression  lines,  to  provide  a 
"confidence  envelope"  enclosing  the  population  line  with  probability  1  -  ® .  Such  an 
envelope  for  a  regression  fuhction  F  (  x;  J9  )  in  which  0  is  a  parameter  vector  has  also 
beencalled  a  confidence  band  (by  Miller,  1966).  It  is  defined  by  functions  L  ( x;  6  )  and  U 
(  x;  e  )  (illustrated  in  Figure  1)  satisfying  the  relationship 

p{u(xj0  -  F(x»  e  )  *  L (  x >J3  ),  V  x)  -  1  -  o 

The  envelope  problem  for  linear  models  has  been  treated  extensively  by  Working  and 
Hotelling  (1929),  Roy  (1957),  Miller  (1966),  and  others;  but  all  use  the  intimate 
interrelationships  of  linear  models  with  multinormal  error  to  develop  computational 

A  A 

formulae  for  U  (x;  j)  )  and  L  (  x;  0.  ).  This  paper  describes  an  algorithm  for  computing 
confidence  envelopes  satisfying  the  above  equation  for  any  function  P  (  x;  Q  )  which  is 
monotonic  in  its  parameters.  This  technique  will  be  applied  to  linear  regression,  to  obtain 
the  classical  result;  then  illustrations  will  be  provided  using  nonlinear  regression. 

It  is  assumed  that  there  are  parameter  estimates  Q  for  which  the  sampling  distribution 

A 

is  multivariate  normal  with  mean  0  and  covariance  matrix  Z  .  Only  multi-parameter 
curves  will  be  considered;  for  one-parameter  curves,  which  are  conditionally  monotonic 
(in  the  sense  defined  below)  the  problem  is  quite  simple.  In  the  multi-parameter  situation, 
a  central  (  1  -  a  )-confidence  region  for  the  parameters,  called  the  (  1  -  a  )-Highest 
Density  Region  (HDR)  by  Novick  and  Jackson  (1977),  is  easy  to  describe:  it  is  defined  by 
the  ellipsoid  containing  100  (  1  -  a )  %  of  the  posterior  density,  where  all  of  the  points 
within  the  ellipsoid  have  higher  density  than  all  of  the  points  outside  the  ellipsoid.  The 
two-dimensional  case  is  illustrated  in  Figure  2. 

The  HDR  for  the  general  multivariate  normal  density  is  defined  most  easily  in  terms  of 
that  for  the  "standard"  multinormal  distribution:  z  ^  N  (  O,  I  ).  The  HDR  for  the 
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k -dimensional  standard  multinormal  is  defined  by  the  hypersphere  (or  circle,  for  the  two- 
dimensional  illustration  in  Figure  3)  with  radius  equal  to  ijx2  (  j  _  a  >  (  k  ),  where 
X2^|.  j  ( k )  is  the  upper  100  ( 1  -  a)  %  critical  value  for  the  X*  distribution  with  k 
degrees  of  freedom.  Since  any  multinormal  variables  may  be  or thonor realized  by  the 
transform  z  -  C  ( 0  -  ),  general  multinormal  6*  may  be  defined  as 

e*  =  cz*  +  e  (1) 

(in  which  C  is  the  Cholesky  factor  of  E  such  that  £  =  CC'  ).  Thus,  the  HDR  for  the 
general  multinormal  in  Figure  2  is  an  ellipsoid  defined  by  equation  (1)  and  the  condition 

z*' z*  *  X2(  l  _  a)0<). 

Since  the  radius  of  this  hypersphere  (or  circle  in  the  two  dimensions  of  Figure  3)  is  fixed, 
the  points  z*  may  be  defined  conveniently  in  hyperspherical  (or  circular)  coordinates  as 
functions  exclusively  of  the  angle  £  . 

Each  point  in  the  parameter  space  defines  a  fitted  line;  the  lines  defined  by  the  parameter 
values  within  the  HDR  in  Figure  2  are  the  lines  which  must  lie  within  the  (  1  -  a  )- 
envelope  in  Figure  1.  Therefore,  the  problem  of  computing  the  boundaries  of  the  ( 1  -  a  )- 
envelope  becomes  that  of  computing,  for  each  value  of  x,  the  value  of  L  (  x;  0. )  which  is 
the  minimum  F  (  x;  £  )  for  any  9.  in  the  HDR,  and  U  (  x;  £  )  which  is  the  maximum 
F  (  x;  6  )  in  the  HDR. 

If  the  function  F  (  x;  £  )  is  "conditionally  monotonic"  in  its  parameters,  where 
"conditional  monotonicity"  means  that  for  parameter  ^  f 


or 


F(x;9i  |e  )  <  p (x,0 i  +  «|ejfli) 

for  all  positive  6 ,  then  the  minimum  and  maximum  values  of  F  (  x;  9.  )  for  the  1  within 
the  HDR  are  always  on  the  boundary  of  the  HDR.  Thus,  computing  L  (  x;  £  )  and 
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U  (  x;  0  )  is  a  matter  of  locating  the  points  e  *  on  the  ellipsoid  in  Figure  2  which 
minimize  and  maximize  F  (  x;  0  )  for  values  of  x.  The  points  e*  are  all  specified  by 
angles  in  z-space  and  the  transformation  in  equation  (1). 

Thus,  to  compute  values  of  the  function  L  (  x;  6  )  for  a  series  of  values  of  x,  and  to  make 
plots  as  in  Figure  1,  one  minimizes  for  each  value  xQ: 

F  (  xo;  0  )  =  F  {  xc;  G  [  H  (  $  5X2  (  x  _  a  )  (  k  ) );  e  ,  £  j  } 

=  F  (  x0 ;  )»  f°r  short, 

in  which 

A  A 

J3  ~  G(z;^,£)  *  Cz  +  j) 

and 

2  =  HU  j  x2(1.a  )  (  k  ) ) 

represent  the  conversion  of  circular  or  hyperspherical  coordinates 
[  <J> ' »  r  =  2(  1  -  a  )  (  k  )]  into  cartesian  coordinates  z.  Minimization  is  in  <j>-space, 

with  dimensionality  k  -  1.  Consequently,  for  a  two-dimensional  problem,  one  solves  for 
the  scalar  angle  <J>  in  Figure  3  which  gives  the  values  of  9  *  in  Figure  2  which  minimize 

/v 

F  (  x  ;  _0  )  in  Figure  1;  that  is  the  value  of  L  (  x  ;  0  ).  After  repeating  this  process 

V  A  ^  - 

frequently  enough  to  plot  L  (  xQ;  @  ),  one  maximizes  F  (  xQ;  0  )  over  o  *  to  obtain 
u  (  x05  8  ). 


The  Linear  Case 

The  derivation  of  the  envelope  for  the  linear  case  is  more  complex  than  the  usual 
arguments  (as  in  Miller,  1966);  however,  generalization  of  this  argument  to  the  nonlinear 
case  is  straightforward.  One  specifies  that 

F  (  Xi  0  )  =  0  X  +  9  (2) 

1  2 
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is  a  linear  function  in  which  o  =  8 1 ,  82  are  the  slope  and  intercept  parameters;  for 
convenience,  the  mean  of  the  (fixed)  x -values  is  zero  and  their  variance  is  1.  One  assumes 
that  8  represents  the  maximum  likelihood  estimate  (MLE)  for  0  from  a  large  sample  of  n 
observations.  The  sampling  distribution  for  _0  is  approximately  N  (  6  ,  E  )  in  which 

1  =  (o2/n)l 

is  the  negative  inverse  expected  value  of  the  matrix  of  second  derivatives  of  the 
loglikelihood,  and  o 2  is  the  error  variance.  Thus, 


C  =  (0/  -/n)I 


L  (  x;^  )  is  obtained  by  minimizing  (2)  for  each  value  of  x  over  values  of  so  that 


8  =  Cz*  +  8 


(3) 


in  which  the  z*  satisfy  the  condition 


The  value  of  h  =  95  )  (  2  )  is  -^6  *2.45,  to  give  a  95%-envelope.  The  condition 

specified  by  (4)  is  met  by  z*  obtained  as 


z*'  =  [  h  cos  $  h  sin  $1  (5) 

/\ 

The  lower  bound  of  the  95%-envelope  L  (  x;  j8  )  is  given  by  the  minimum  of  the  linear 
function  (2)  with  the  constraint  that  0  satisfies  (3)  in  which  z*  satisfies  (5).  The 
minimization  is  in  $-space. 


The  required  minimum  is  obtained  where  the  derivative  of  (2)  with  respect  to  4>  is  zero; 
that  is 


3  F  /  3<|>  =  +  882/3$>  =  0 


(6) 


n 


♦  s  tan  _1  ( 1  /  X)  (8) 

Solutions  of  4>  between  O®  and  180°  give  maximum;  solutions  between  180°  and  360° 
give  minimum.  Such  a  closed-form  result  is  simpler  than  those  to  be  obtained  in 
subsequent  sections  on  nonlinear  models.  To  plot  L  (  x;  £  )  compute  <t>  using  (8),  z*  as  in 
(5),  o  as  in  (3)  and  F  from  (2)  and  plot  the  corresponding  hyperbola. 

Or,  to  obtain  the  customary  form  given  by  Working  and  Hotelling  (192F)  and  Miller  (1966), 
note  that  for  $  satisfying  (8) 

xcos<J>  +sin$  =  +  ti  +  <21  1/2  (9) 

plus  for  maximum  and  negative  for  minimum. 


12 


r*bc&>afl»3r^ftl^•,' 


(Geometric  proof  of  (9)  is  obvious  using  any  triangle  satisfying  (8).)  Using  the  identity  in 
(9)  and  substituting  (3)  in  (2)  (keeping  the  terms  separate),  one  obtains 

L(x;0)  =»  0xx  +  02  -ho/YrT  ll+x2J1/2  (10) 

where  the  first  two  terms  are  from  6  in  (3)  and  the  last  term  is  the  product  of  Cz*  and  x. 
This  is  the  usual  form  for  the  lower  bound,  as  in  Miller  (1966,  p.  Ill),  with  h  (the  root  of 
the  large-sample  x 2 )  replacing  -/2F  and  x  standardized.  Thus,  for  the  linear  case,  the 
trigonometry  implied  by  4>  is  computationally  optional;  it  serves  only  if  it  is  desirable  to 
know  the  parameter  values  associated  with  some  point  on  the  lower  bound.  The 
trigonometry  will  be  required  in  the  nonlinear  case,  however.  The  upper  bound  simply 
reverses  the  negative  sign  in  (10). 

A  Simple  Nonlinear  Example 

In  the  case  of  logistic  regression  of  a  binary  response  on  some  fixed  x,  F  (  x;  e  )  is  the 
logistic  function 

F  (  x;  9 ) «  1  /  11  + exp  (-u)J  , 
u=  9i (x - 02 ) 

in  which  0  v  is  the  logit-slope  and  0  2  is  the  location  parameter,  estimated  so  that  the 

A 

sampling  distribution  of  j)  is  N  (  _0  ,  II 2  =  CC'  ).  F  (  x;  0,  )  is  a  minimum  wherever  u  is 
minimized,  and  F  (  x;  9  )  is  maximized  wherever  u  is  a  maximum.  To  minimize 
(maximize)  F,  minimize  (maximize)  u.  Only  minimization  will  be  considered  to  locate 

A  A 

L  (  x;  0  );  to  maximize  for  U  (  x;  ),  reverse  the  signs.  Minimize  u  with  respect  to  ♦. 
The  minimum  of  u  for  a  particular  xQ  is  obtained  where  the  derivative 


Since 


3u/3$  *=  (x- 9  2  )  3  9  i  /  3<})  -  9i30  2  /34*  ■  0, 


0  *  Cz  +  0 


30/34>  *  C3z/30, 


z  -  [  h  cos  4>  h  sin  <t>  1 ,  and 
9z' /9  <t>  =  [-hsin<l>  h cos  <M 

in  which  h»^x2  ( i  -  a  )(2 ),  the  derivative  in  (li)  represents  a  straightforward 
computation.  ilnllce  the  linear  example,  there  is  no  closed-form  solution  in  this  case,  but 
with  the  second  derivative 

3  2  u/3$  2  -  (x  -  02  )  9  2  0i  /3$  2  -  230i/9$  302/9$  -  0i9202/9  <|>2 

the  function  u  is  easily  minimized  (over  $  )  by  Newton-Raphson  for  a  set  of  values  of  x, 
and  the  results  may  be  plotted  as  L  ( x;  0. ). 

A  More  Complex  Example 

In  a  more  complex  sort  of  logistic  regression  sometimes  used  in  mental  test  theory,  a 
lower  asymptote  0  3  is  also  estimated, 

F  (  x;  0  )=  0  3  +  ( 1  -  83  )  { 1  /  11  +  exp(  -u  )  1 ) 

=  0  3+  (  1  -  0  3  )P,  and 

U  =  01  (x-  0  2) 

and,  for  notational  convenience, 

Q  -  1  -  P 

where  P  indicates  the  probability  of  a  correct  response  (Lord,  1980). 

Again,  assume  that  Q  has  been  estimated  so  that  the  sampling  density  is 
N  ( u  ,  E  =  CC' ).  Since  there  are  three  parameters  in  this  model,  minimization  and 
maximization  of  F  (  x  ;  £  )  must  be  with  respect  to  a  two-dimensional  angle  £,  and  all  of 

*  A 

the  derivatives  are  more  complex.  Nevertheless,  L  (  xQ;  0  )  is  obtained  at  the  point  at 
which  3f/3^  -  0  ,  where 


Dp/ 3^  ■  (1  -  6j)  PQ  3u/3$  +  Q38s/3$ 

and 

3u/3<|!  m  ( x  -  0  2 )  3  9  1  /  3j£  -  0 1 3  8  2 / 3£  . 

As  before, 

A 

0  -  Cz  +  J3, 

SO 

{30/3$]  -  C  [3z/3£] 

and  since 

z'  =  [  h  sin$  2  cos  $i  h  sin  <t>  2  sin$x  hcos$2  1 


it  follows  that 


[ 3*./3$  1  "  = 

in  all  of  which  h 

With  the  matrix  of  second  derivatives  (which  is  entirely  too  big  and  complex  to  reproduce 
here),  the  function  F  (  x  ;  $ )  can  be  minimized  over  £  to  give  L  (  x_;  £  ),  and 
maximized  to  give  U  (  xQ5  0  ).  The  minimization  problem  is  fairly  ill-conditioned  for 
some  values  xQ,  so  both  a  conditioned  Newton-Raphson  algorithm  and  multiple  starting 
values  must  be  used  to  locate  the  minimum. 

A 

Plotting  L  (  x;  0  )  can  be  accomplished  using  the  following  system.  Start  at  the 

A 

leftmost  xQ  to  be  plotted,  and  successively  (iteratively)  compute  L  (  x;  £  )  as  the 
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minimum  F  (  x;  £  )  for  each  value  of  xQ  to  be  plotted,  always  using  the  value  of  £  for  the 
previous  (left-adjacent)  x0  as  the  starting  value.  When  the  rightmost  xQ  is  reached, 
reverse  the  procedure  and  re-compute  each  point,  starting  from  £  for  the  right-adjacent 
x0.  In  the  flattest  regions,  the  results  for  the  two  passes  sometimes  differ.  Since  the 
process  is  one  of  minimization,  to  get  L  (  x_;  £  ),  retain  the  lower  value.  The  values  for 

/V  " 

U  (  x0;  e  )  are  computed  with  the  same  algorithm,  with  the  signs  reversed. 

II.  N-line  Plots 

While  plots  of  the  (  1  -  a  )-envelope  define  a  (  1  -  a  )-confidence  interval  for  a  fitted 
curve,  they  do  not  graphically  display  the  density  of  curves  within  the  envelope.  For  that 
purpose,  there  are  N-line  plots,  which  are  plots  which  (approximately)  fill  the  ( 1  -  a  )- 
envelope  with  curves,  in  proportions  roughly  corresponding  to  their  likelihood.  N-line 
plots  consist  of  plots  of  N  lines  drawn  from  parameters  randomly  drawn  from  the  sampling 
distribution  for  the  parameters.  N-line  plots  are  easiest  to  describe  by  a  series  of 
examples. 

A  One-Parameter  Example:  25-Line  Plots 

An  extremely  simple  variety  of  logistic  regression  is  sometimes  used  in  mental  test 
theory;  it  makes  use  of  a  model  of  the  form 

F  (  x;  0)  =  1  /  [1+  exp(-u)  1, 

u=  x  -0 

in  which  0  is  a  location  parameter,  and  the  only  estimated  parameter  in  the  model.  Since 
this  one-parameter  function  is  monotonic  in  its  parameter,  the  computation  of  the 
(  1  -  u  )-envelope  for  the  fitted  curve  is  trivial,  as  mentioned  above:  if  0  is  the  estimate 
of  o  ,  with  a  standard  error  of  o,  then  the  upper  boundary  for  the  95%-envelope  is 

A  A 

U  (  x;  o)-F(x;e  -1.96  a);  and  the  analogous  lower  boundary  for  the  envelope  is 

A  A 

L  (  x;  e  )  =  F  (  x;  0  +  1.96  a).  A  typical  95%  envelope  for  such  a  function  is  plotted  in 
Figure  4,  with  the  modal  (fitted)  curve  in  the  center  of  the  envelope.  The  envelope  plot 
describes,  with  its  boundaries,  the  central  95%  confidence  interval  for  the  fitted  curve. 


One-parameter  model;  a=1.0,  b=0.0,  c=0.0,  n=100,  95%  confidence 
envelope.  In  this  and  in  subsequent  figures  the  letters  a,  b,  and 
c  indicate  the  parameters  of  a  logistic  item  response  model  (Lord, 
1980),  and  so  correspond  to  the  vector  £  discussed  in  the  text. 
Parameter  a  is  related  to  the  steepness  of  the  curve  at  its  steepest 
point,  b  indicates  the  position  of  the  curve  along  the  X  axis,  and  c 
is  the  lower  asymptote. 


But  can  the  density  for  the  fitted  curve  be  "filled  in"  a  bit?  Figure  5,  following  the 
assumption  that  the  density  for  0  is  N  (  0 ,  o 2  ),  shows  plots  of  the  curves  corresponding 
to  25  random  deviates  from  a  normal  density  with  mean  3  and  variance  a 2  .  Note  that 
such  randomly  sampled  curves  are  dense  near  the  curve  corresponding  to  3  and  fill  the 
95%-envelope  in  Figure  4.  Thus,  a  25-line  plot  is  a  "filled-out"  envelope  plot,  even  though 
the  density  of  lines  is  less  near  the  edges. 

The  envelope  described  by  the  upper  and  lower  extreme  curves  of  the  25-line  plot  is,  in 
expectation,  precisely  equal  to  the  true  95%  envelope.  This  is  because  the  distance 
between  the  upper  and  lower  boundaries  of  the  95%-envelope  plot  is  determined  by 

/V  A 

plotting  F  (  x;  0  +  1.96  o  )  and  F  (  x;  0  -  1.96  o  )  i.e.,  plotting  F  for  two  values  of 

A 

centered  on  e  ,  which  have  a  range  of  2  ( 1.96  )  o  .  The  random  deviates  which  yield  the 
25-line  plot  are  also  centered  on  0  ,  and  the  expected  value  of  their  range  is  2  (  1.96  )  °. 
(The  expected  value  of  the  range  of  a  sample  of  N  =  25  standard  normal  deviates  is 
3.93  ■  2  (  1.96  )  (Pearson  and  Hartley,  1956).)  Thus,  the  boundaries  of  the  95%-envelope 
and  the  expected  values  of  the  highest  and  lowest  curves  of  a  25-line  plot  are  in  the  same 
place.  The  25-line  plot  gives  a  clearer  graphic  description  of  the  envelope  for  the  curve. 

A  Two-Parameter  Example:  85-Line  Plots 

A  two-parameter  logistic  regression,  in  which 

F  (  x;  0  )  =  1  /  11+  exp  (  -u  )  1  ,  and 

U  =  01  (  X  -  02) 

provides  more  interesting  N-line  plots,  but  they  are  more  costly  to  make.  The  vector  0  is 
assumed  estimated  with  sampling  density  N  (  0  ,  Y.  ).  If  the  upper  and  lower  extreme 
curves  of  the  N-line  plot  are  to  equal  (in  expectation)  the  boundaries  of  the  95%-envelope, 
the  appropriate  N-line  plot  should  have  as  its  (squared)  range  on  any  line  through  the 
density  (shown  in  Figure  2),  the  Mahalanobis  distance  given  by 

<0  -  0)'  l'1  (0-0)  -  X2(1_a)  (2)  . 

that  is,  on  any  line  through  the  two-dimensional  space  in  Figure  2,  the  expected  range  of 
the  projections  of  the  random  deviates  of  the  N-line  plot  should  be  the  same  as  the 
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distance  between  the  two  boundaries  of  the  HDR  on  that  line.  Every  value  of  x  has  as  its 
upper  and  lower  (  1  -  a  )-envelope  boundaries  the  parameters  at  the  intersections  of  the 
HDR  boundary  with  some  line  in  S. -space.  So  the  requirement  that  the  expected  value  of 
the  range  of  the  projections  of  the  random  sample  on  any  such  line  be  equal  to  the  HDR 
boundary  difference  gives  the  desired  result:  the  expected  location  of  the  upper  and 
lower  N-liae  curves  are  along  the  upper  and  lower  ( 1  -  a  )-envelope  boundaries. 

For  the  two-dimensional  case,  N  (regrettably)  is  85.  This  is  determined  by  noting  that  the 
0  -space  of  Figure  2  is  a  projection  of  the  z -space  of  Figure  3,  and  the  95%HDR 
boundary  of  Figure  3  has  radius  equal  to-^jx2  ^  g5  ^  (  2  )  =  "^/iTo  =  2.45.  Thus  one  makes 
N-line  plots  from  the  two-space  by  sampling  standard  normal  deviates  in  the  orthonormal 
space  (from  the  density  in  Figure  3),  transforming  them  to  have  the  9.  -density  in  Figure  2, 
and  plotting  the  resulting  curves.  So  the  expected  value  of  the  range  of  the  standard 
normal  z's  sampled  must  be  2  (  2.45  ),  to  give  the  N-line  plot  the  required  breadth. 
Pearson  and  Hartley's  (1956)  Table  27  indicates  that  the  required  sample  size  is  N  =  85. 
Figure  6  shows  an  illustrative  85-line  plot;  Figure  7  is  the  corresponding  95%-envelope 
plot.  Note  that  the  85-line  plot  shows  clearly  how  the  odd  shape  of  the  95%-envelope 
arises  from  the  distribution  of  curves.  Flat  curves  make  up  U  (  x;  1  )  on  the  left  and 

A  A  A 

L  (x;  o  )  on  the  right,  whereas  steep  curves  make  up  L  (x;  0  )  on  the  left  and  U  (x;  )  on 

the  right. 

Eighty-five  lines  can  be  tedious  to  plot.  Twenty-five  line  plots  are  equivalent  (in 
expectation)  to  only  85%-envelopes,  but,  as  Figure  8  shows,  give  much  of  the  detail 
available  in  85-line  plots. 

Lots-of-Line  Plots 

By  an  argument  similar  to  that  of  the  preceding  section,  N-line  plots  for  three-parameter 
models  require  N  =  235  to  match  (in  expectation)  95%-envelope  plots.  Figure  9  shows  a 
235-line  plot  for  the  three-parameter  logistic  model  described  above,  and  Figure  10  is  the 
95%-envelope  plot.  Two  hundred  thirty-five  lines  take  a  plotter  a  long  time.  (The  width 
of  a  25-line  plot  is  equivalent  (in  expectation)  to  a  72%-envelope.) 


Figure  9.  Three-parameter  model;  a-1.5,  b=1.0, 
c=0.1,  n=l,000,  235-line  plot. 


N-line  plots  become  cumbersome  as  dimensionality  increases.  Some  alternatives  to  95%- 
envelope -equivalent  N-line  plots  may  be  suggested.  The  simplest  is  to  combine  the 
( 1  -  a  ) -envelope  itself  (computed  as  in  Section  I)  with  a  small  number  of  randomly 
sampled  curves  to  give  some  idea  about  the  structure.  Another  possibility  is  to  abandon 
the  dependence  of  N-line  plots  on  extreme-value  statistics.  One  could  plot  a  small 
number  of  lines,  and  then,  for  each  value  of  x,  estimate  the  standard  deviation  of 
F  ( x;  9  ).  That  estimate  could  be  multiplied  by  the  factor  appropriate  to  the 
dimensionality  of  the  problem  (2.45  for  two,  2.8  for  three,  and  so  on)  and  added  to  (and 
subtracted  from)  the  modal  curve  to  give  projections  of  the  (  1  -  a  ) ^envelope.  The 
standard  deviation  could  be  estimated  robustly,  to  depart  indeed  from  the  dependence  on 
extreme-value  statistics  in  the  methods  above. 

On  the  other  hand,  one  could  (less  robustly,  but  more  quickly)  depend  more  on  extreme 
value  statistics,  using  half-N-line  plots.  To  make  a  half-N-line  plot,  plot  the  number  of 
randomly  sampled  curves  needed  to  have  the  expected  range  equal  to  half  the,  desired 
range  (0.98  for  one  dimension,  1.23  for  two,  1.4  for  three,  and  so  on;  N's  =  4,  6,  8. . .).  The 
data  analyst  then  must  visually  multiply  the  width  of  the  plot  by  two  to  see  the  desired 
result,  which,  with  such  small  N's,  would  be  unreliable.  Still,  it  would  be  quick. 

Conclusion 

The  methodology  derived  and  illustrated  here  provides  an  operational  connection  between 
the  stochastic  variation  in  the  parameter  space  and  the  observed  variation  in  the  function 
space.  This  explicit  connection  provides  a  mechanism  for  the  establishment  of  confidence 
bounds  around  non-linear  functions.  Such  bounds  were  not  available  with  previous 
techniques. 

Earlier  work  (Thissen  &  Wainer,  1982)  showed  that  the  standard  errors  of  item  parameters 
for  commonly  used  models  were  so  large  as  to  render  use  of  the  models  impractical  in 
some  cases  even  with  moderately  large  sample  sizes.  The  techniques  herein  developed 
can  be  used  to  determine  the  consequences  of  these  large  standard  errors  on  the 
estimation  of  the  overall  response  function.  Such  measures  are  relevant  for  many 
practical  applications  of  item  response  theory  (such  as  computer  adaptive  testing). 

Recent  investigations  (Lord  and  Stocking,  personal  communication)  have  shown  that  the 
response  surface  being  explored  for  the  three-parameter  logistic  function  boundaries  is 
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poorly  behaved,  with  local  extrema  being  encountered  more  often  than  seldom. 
Consequently  the  bounds  based  on  the  methods  of  this  paper  may  be  too  narrow.  This 
limitation  is  not  too  serious,  for  one  typically  finds  that  the  error  bounds  are  often  too 
broad  for  the  delicate  decisions  required.  Thus  if  the  bounds  are  perhaps  narrower  than 
they  should  be  and  are  still  too  broad,  this  limitation  is  unlikely  to  be  critical.  We  have 
not  found  these  same  local  extrema  problems  with  the  one  or  two  parameter  models. 
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